A Simple Reconstruction of GPSG

نویسنده

  • Stuart M. Shieber
چکیده

Like most linguistic theories, the theory of generalized phrase structure grammar (GPSG) has described language axiomatically, that is, as a set of universal and language-specific conattaints on the well-formedncss of linguistic elements of some sort. The coverage atttl detailed analysis of English grammar in the ambitious recent volume by Gazdar, Klein, Pullum, and Sag entitled Generalized Phrase Structure Grammar [2] are impressive, in part because of the complexity of the axiomatic system developed by the authors. In this paper, we examine the possibility that simpler descriptions of the same theory can be achieved through a slightly different, albeit still axiomatic, method. Rather than characterize the well-formed trees directly, we progress in two stages by procedurally characterizing the well-formedness axioms themselves, which in turn characterize the trees. 1 I n t r o d u c t i o n I Like most llngafistic theories, the theory of generalized phrase structure grammar (GPSG) has described language axiomatically, that is, as a set of universal and language-specific constraints on the we[l-formedncss of linguistic elements of some sort. In the case of GPSG, these elements are trees whose nodes are themselves structured entltics from a domain of categories (a type of feature ~trueture [6]). The proposed axioms have become quite complex, culminating in the ambitious recent volume by Gazdar, Klein, Pullum, and Sag entitled Generalized Phrase Structure Grammar [2]. The coverage and detailed analysis of English grammar in this work are impressive, in part because of the complexity of the axiomatic system developed by the a u t h o r . In this paper, we examine the possibility that simpler descriptions of the same theory can be achieved through a slightly different, albeit still axiomatic, method. Rather than characterize the well-formed trees driectly, we progress in two stages by procedurally characterizing tim well-formedaess axioms themselves, which in turn charaetei'ize the trees. In particular, we give a procedure which converts GPSG g ramma~ into g r amma~ written lThls research was m~de possible by a gift. from the System Development Foundation. I am indebted to Lauri K~rttuncn and Ray Perrault for their eomrael~te on earlier drafts, and to Roger Evans, Gerald Gszdsr~ Ivan S~.$t ltenry Thompson, and members of the Foundations of Grammar project at the Center for the Study of Language and Information for their helpful discussions during the development of this work. in a unification-b~qed formalism, the PATR-II formalism developed at SRI International (henceforth PATR) [5], which h~s its own declarative semmltics, and which can therefore be viewed &s an axiomatization of string well-formedness constraints. 2 The characterization of GPSG thus obtained is simpler and better defined than the version described by Gazdar et al. The semantics of the formalism is given directly through the reduction to PATR. Also, the PATR axiomatization has a clear construetire interpretation, unlike that used in Gazdar et al., thus making the system more amenable to computational implementation. Finally, the characteristics of the coml~ilation--the difficulty or ease with which the various devices can be encoded in PATR-can provide a measure of the expressiveness and indispensability of these devices in GPSG. 2 T h e G P S G A x i o m s 2 .1 A S u m m a r y o f t h e P r i n c i p l e s GPSG describes natural languages in terms of various types of constraints on local sets of nodes in trees. Pcrtlncnt to the ensuing discussion are the following: • ID (immediate dominance) rules, which state constraints of immediate dominance among categories; • metarules, which state generalizations coI~ccraing classes of ID rules; ¢ LP (linear precedence) rules, which constrain the Ihwar order of sibling categories; • feature cooccurrencc restrictions (FCR), which constrain the feature structures as to which arc permissiHe categories; a feature specification defaults (FS1)), which provide values for features that are otherwise unspecified; and, most importantly, 21towever, a caveat is ]n order th:~t the detailed ~u~alysis from this perspective of the full range of GPSG devices (especially immediate dominance (ID) rules, and feature cooccurrence restrictions) is not discussed fillly here, nor do I completely understand them. (See Section 3.4.} And while in a confessional mood, I should add that the Msorlthm given here has not actually been implemented.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Defining Natural Language Grammars in GPSG

1 Overview Three central goals of work in the generalized phrase structure grammar (GPSG) linguistic framework, as stated in the leading book "Generalized Phrase Structure Grammar" Gaz-dar et al (1985) (hereafter GKPS), are: (1) to characterize all and only the natural language grammars, (2) to algorithmically determine membership and generative power consequences of GPSGs, and (3) to embody th...

متن کامل

Crossed Serial Dependencies: A low-power parseable extension to GPSG

An extension to the GPSG grammatical formalism is proposed, allowing non-terminals to consist of finite sequences of category labels, and allowing schematic variables to range over such sequences. The extension is shown to be sufficient to provide a strongly adequate grammar for crossed serial dependencies, as found in e.g. Dutch subordinate clauses. The structures induced for such construction...

متن کامل

A Unification-based Approach to Mandarin Questions

This paper providers unification-based GPSG and LFG analyses of Mandarin questions. First, we briefly introduce four kinds of Mandarin question, namely, WH-questions, A-not-A questions, disjunctive questions, and particle questions. Their different interrogative messages are adequately encoded with different feature-value pairs. Then, the compatibility of this interrogative information in a sim...

متن کامل

Computational Complexity of Current GPSG Theory

An important goal of computational linguistics has been to use linguistic theory to guide the construction of computationally efficient real-world natural language processing systems. At first glance, generalized phrase structure grammar (GPSG) appears to be a blessing on two counts. First, the precise formalisms of GPSG might be a direct and fransparent guide for parser design and implementati...

متن کامل

Effective Parsing With Generalised Phrase Structure Grammar

Generalised phrase structure grammars (GPSG's) appear to offer a means by which the syntactic properties of natural languages may be very concisely described. The main reason for this is that the GPSG framework allows you to state a variety of meta-grammatical rules which generate new rules from old ones, so that you can specify rules with a wide variety of realisations via a very small number ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1986